Optimal Parallel Approximation Algorithms for Prefix Sums and Integer Sorting (Extended Abstract)

نویسندگان

  • Omer Berkman
  • Dany Breslauer
  • Zvi Galil
  • Yossi Matias
  • Baruch Schieber
  • Michael T. Goodrich
  • Richard E. Ladner
  • Philip D. MacKenzie
چکیده

Ragde. Triply-logarithmic upper and lower bounds for minimum, range minima, and related problems with integer inputs. Optimal doubly logarithmic parallel algorithms based on nding all nearest smaller values. Optimal sub-logarithmic time integer sorting on a CRCW PRAM (note). Submitted for publication, 1991. 27] Sandeep Sen. Finding an approximate median with high probability in constant parallel time. 9 Let j be the set of elements of value j, j = 1; : : :; n. The integer-chain sorting algorithm in 11, Sec. 9] consists of t = O(lg n) iterations. At iteration i, an array A i of size cn=2 i is used. For each set j an interval D i j of size d i j = O(i j) may be allocated in A i , where i j is an estimate for j j j computed in this step, and a subset i j j is injectively mapped into D i j. At the end of the t = O(lg n) iterations, each element in j is mapped into a private cell in one of the intervals D i j. Let d i j = P i k=1 d i j , and d j = d t i. The preex sums d i j i , j = 1; : : :; n, can be obtained by straightforward modiications of the integer chain sorting algorithm. After the execution of this algorithm, we can obtain the padded integer sorting sequence in an array B of size 2cn, as follows: Step 1. Allocate from B to each set j a private interval B j of size d j , so that the allocation is ordered according to j. Step 2. For each j, allocate from B j to each subset i j a private sub-interval of size d i j , so that the allocation is ordered according to i. B i j Step 3. Copy the contents of each interval D i j into interval B i j. It is easy to verify that the input elements are indeed sorted in the array B. Step 1 is implemented by the ordered allocation algorithm of Theorem 5.1. Step 2 is implemented using the preex sums sequences d i j i , j = 1; : : :; n. Step 3 is trivial. We have Theorem 5.2. (optimal padded integer sorting) The padded integer sorting problem can be solved in time …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal deterministic approximate parallel prefix sums and their applications

We show that extremely accurate approximation to the preex sums of a sequence of n integers can be computed deterministically in O(loglog n) time using O(n= loglog n) processors in the Common CRCW PRAM model. This complements randomized approximation methods obtained recently by Goodrich, Ma-tias and Vishkin and improves previous deterministic results obtained by Hagerup and Raman. Furthermore ...

متن کامل

Optimal and Sublogarithmic Time Randomized Parallel Sorting Algorithms1

We assume a parallel RAM model which allows both concurrent reads and concurrent writes of a global memory. Our main result is an optimal randomized parallel algorithm for INTEGER SORT (i.e., for sorting n integers in the range [1, n]). Our algorithm costs only logarithmic time and is the first known that is optimal: the product of its time and processor bounds is upper bounded by a linear func...

متن کامل

: Parallel Algorithms for Bucket Sorting and the Data Dependent Prefix Problem

The data dependent prefix problem is to compute all the n initial products x1⃝x2⃝...⃝xk, 1 ≤ k ≤ n, where the order is specified by a linked list. A parallel algorithm for the data dependent prefix problem is presented. This algorithm has time complexity O( n p + log n log n p ) using p processors on the exclusive-read exclusive-write computation model. A bucket sorting algorithm is also develo...

متن کامل

Optimal Parallel Algorithms for Computing the Sum, the Prefix-Sums, and the Summed Area Table on the Memory Machine Models

The main contribution of this paper is to show optimal parallel algorithms to compute the sum, the prefix-sums, and the summed area table on two memory machine models, the Discrete Memory Machine (DMM) and the Unified Memory Machine (UMM). The DMM and the UMM are theoretical parallel computing models that capture the essence of the shared memory and the global memory of GPUs. These models have ...

متن کامل

Integer Problems on Reconfigurable Meshes, with Applications*

Problems with solutions based upon integer computations are collectively referred to as integer problems. Such problems occur routinely in pattern recognition, image processing, graph theory, and query processing. The purpose of this paper is to present constant time algorithms for computing the prefix sums of an integer sequence, and for solving the packing problem on reconfigurable meshes. Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994